Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
عنوان | 4069 | 177 | 9 | 19.6667 |
اساس | 741 | 34 | 2 | 17.0000 |
معنای | 371 | 12 | 1 | 12.0000 |
فراهم | 363 | 24 | 2 | 12.0000 |
حساب | 228 | 12 | 1 | 12.0000 |
جمله | 868 | 22 | 2 | 11.0000 |
طور | 258 | 22 | 2 | 11.0000 |
صورت | 2382 | 130 | 12 | 10.8333 |
دلیل | 1206 | 70 | 7 | 10.0000 |
ثمر | 118 | 9 | 1 | 9.0000 |
بدل | 75 | 9 | 1 | 9.0000 |
غیره | 154 | 9 | 1 | 9.0000 |
بدین | 231 | 17 | 2 | 8.5000 |
عهده | 411 | 17 | 2 | 8.5000 |
کوچکی | 94 | 8 | 1 | 8.0000 |
سود | 137 | 8 | 1 | 8.0000 |
متهم | 114 | 8 | 1 | 8.0000 |
پیروز | 102 | 8 | 1 | 8.0000 |
۱۳۸۵ | 171 | 8 | 1 | 8.0000 |
تمرکز | 122 | 8 | 1 | 8.0000 |
Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
شد | 10072 | 7 | 366 | 0.0191 |
کرد | 7041 | 9 | 348 | 0.0259 |
شدهاست | 4990 | 5 | 171 | 0.0292 |
است | 21493 | 23 | 712 | 0.0323 |
میشود | 6780 | 11 | 259 | 0.0425 |
کردند | 1418 | 6 | 135 | 0.0444 |
میباشد | 2305 | 4 | 89 | 0.0449 |
میکند | 3594 | 11 | 242 | 0.0455 |
کردهاست | 872 | 4 | 70 | 0.0571 |
شدند | 1287 | 6 | 104 | 0.0577 |
بود | 8856 | 16 | 264 | 0.0606 |
میکنند | 1829 | 9 | 140 | 0.0643 |
دارد | 7541 | 8 | 123 | 0.0650 |
کند | 2270 | 12 | 173 | 0.0694 |
میکرد | 896 | 5 | 70 | 0.0714 |
داشت | 2116 | 7 | 95 | 0.0737 |
میشوند | 1534 | 8 | 107 | 0.0748 |
یافتهاست | 160 | 1 | 13 | 0.0769 |
نشدهاست | 125 | 1 | 13 | 0.0769 |
میلادی، | 384 | 1 | 13 | 0.0769 |
In this subsection, we compute the ratio of the number of right neighbors and the number of left neighbors. Again, we look for words with extreme ratios:
Data for first table:
select word,w.freq,aa.cnt, bb.cnt,aa.cnt/bb.cnt as r from words w, (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where w_id=aa.w1_id and aa.w1_id=bb.w2_id order by r desc limit 20;
Diagram data:
select aa.cnt, bb.cnt from (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where aa.w1_id=bb.w2_id;
5.1.7.1 Number of NN co-occurrences vs. Frequency I
5.1.7.2 Number of NN co-occurrences vs. Frequency II